📉 Model Quantization - minezone · Scour

From scikit-learn to Production, Deploying ML Models That Actually Work

dev.to·12h·

Discuss: DEV

Is anyone compressing AI models for the 4B people without GPUs or internet?

news.ycombinator.com·22h·

Discuss: Hacker News

Optimizing Recommendation Systems with JDK’s Vector API

netflixtechblog.com·1h

🗂️Vector Databases

Qwen 3.5 9B, 4B models beating 30B, 80B models

huggingface.co·7h·

Discuss: Hacker News

🚀Performance

The Architecture Behind Open-Source LLMs

blog.bytebytego.com·10h

🧩LLM Integration

Accelerating Sonar Through Speculation

research.perplexity.ai·5h

🧩LLM Integration

Learning What Will Happen Next: Predictive Coding in Hyperspace

blog.brojo.ai·22h·

Discuss: Hacker News

Skin Health at the Edge: Real-time Lesion Screening with MediaPipe and TensorFlow.js 🩺✨

dev.to·1h·

Discuss: DEV

LLM Edge Predictive Maintenance — AI Agents for Industrial Vibration Diagnostics

lgdimaggio.github.io·5h·

Discuss: Hacker News

💡Observability on a Budget

mradermacher/Qwen3.5-122B-A10B-heretic-GGUF

huggingface.co·10h·

Discuss: r/LocalLLaMA

🏔️Alpine.js

How I automate mcp integration recipes cookbook for AI agent workflows

jamiesupply.gumroad.com·9h·

Discuss: DEV

💬Prompt Engineering

Understanding Rope: From Rotary Embeddings to Context Extension

mli0603.notion.site·17h·

Discuss: Hacker News

In-context learning of representations can be explained by induction circuits

lesswrong.com·3h

♿Web Accessibility

Edge AI: Not Just a Cost Issue, But a Power Issue

guanjiawei.ai·12h·

Discuss: DEV

The AI Efficiency Survey

sambanova.ai·23h

💬Prompt Engineering

memvector/ext-memvector: MemVector — Local Vector Database & Embedding Engine for PHP

github.com·1d·

Discuss: Hacker News

🧲Vector Search & Embeddings

March 5 - AI, ML and Computer Vision Meetup

voxel51.com·8h·

Discuss: DEV

Detecting LLM-Generated Web Novels Using "Classical" Machine Learning (AIGC Text Detection)

blog.lyc8503.net·1d·

Discuss: Lobsters

Optimal Heterogeneous Memory Configs for AI Tasks Under Specified Performance Metrics (Stanford, UCSC)

semiengineering.com·1d

⚡Cache Optimization

Fast Autoscheduling for Sparse ML Frameworks

fredrikbk.com·2d·

Discuss: Hacker News

Loading more...